THE CORRELATION DECAY (CD) TREE AND STRONG SPATIAL MIXING IN 

MULTI-SPIN SYSTEMS 



CHANDRA NAIR AND PRASAD TETALI 

Abstract. This paper deals with the construction of a correlation decay tree (hypertree) for interacting 
systems modeled using graphs (hypergraphs) that can be used to compute the marginal probability of any 
vertex of interest. Local message passing equations have been used for some time to approximate the 
marginal probabilities in graphs but it is known that these equations are incorrect for graphs with loops. 
In this paper we construct, for any finite graph and a fixed vertex, a finite tree with appropriately defined 
boundary conditions so that the marginal probability on the tree at the vertex matches that on the graph. 
For several interacting systems, we show using our approach that if there is very strong spatial mixing on an 
infinite regular tree, then one has strong spatial mixing for any given graph with maximum degree bounded 
by that of the regular tree. Thus we identify the regular tree as the worst case graph, in a weak sense, for 
the notion of strong spatial mixing. 



1. Introduction 

In this paper we show that computation of the marginal probability for a vertex in a graphical model 
can be reduced to the computation of the marginal probability of the vertex in a rooted tree of self-avoiding 
walks, with appropriately defined boundary conditions. The computation tree approach for graphical models 
has been used by [Wei06a| . |BG06| , |GK07j . for the problems of independent sets, colorings and list-colorings. 
In [JS06] , the work of |Wei06aj for computing marginal probabilities was extended to inference problems in 
general two spins models. Our work builds on [Wei06a| IJS06] , and demonstrates how the computation tree 
can be extended to more than two spins and also for more than two-body interactions. This leads to a 
different tree (the correlation decay tree), which in a sense is more natural than the dynamic programming 
based tree of |GK07j for the case of multiple spins. Further, this approach also yields a tree for the case of 
multi-spin interactions with multiple spins. 

A practical motivation for the creation of a tree structure is the following. The feasible algorithms for 
computation of marginal probabilities in large interacting systems are constrained to be distributed and 
local. This requirement has given rise to message passing algorithms (like belief propagation) for systems 
modeled using graphs. Unfortunately, these algorithms do not necessarily give the correct answer for graphs 
with many loops, and may not even converge. However, for a tree it is known that the equations are exact 
and the marginal probability at the root can be computed in a single iteration by starting from the leaves. 
Thus, if for any graph one can show the existence of a tree, that respects the locality, in which the same 
marginal probability results, then one can use the exactness of the message passing algorithms on a tree 
to obtain a convergent, distributed, local algorithm for the computation of marginal probabilities on the 
original graph. 

The caveat with this approach is that the size of the tree can be exponentially large compared to the 
original graph. So even though the computations are exact, they may not be efficient in practice. However, 
for certain interesting counting problems [Wci06a| IGK07| lBGK + 06] approximation algorithms have been 
designed using the notion of spatial correlation decay, where the influence of the boundary at a root decays 
as the spatial distance between the boundary and the root increases. Hence pruning the tree to an efficiently 
computable neighborhood usually yields good and efficient approximations. Thus, to design efficient algo- 
rithms it would be useful to show some kind of decay of correlation in the tree structure that is presented 
here (and hence the name correlation decay tree). 

The second part of this paper addresses this issue of spatial correlation decay. We show that, for lots of 
systems of interest, if there is "very strong spatial mixing" in the infinite regular tree of degree D, then there 
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also exists "strong spatial mixing" for any graph with maximum degree D. So, in a loose sense, the infinite 
regular tree is indeed a worst case graph for correlation decay. The fact that some form of strong spatial 
mixing in the infinite regular tree should imply strong spatial mixing in graphs for a general multi-spin 
system was conjectured by E. Mosscl, [Mos07] . (In the case of independent sets and colorings, the infinite 
tree being the worst case for the onset of multiple Gibbs measures was conjectured by A. Sokal [SokOOj .) 

In the next section, we prove the generalization of the result in Wci06aJ to the case of multiple-spins but 
still restricting ourselves to two-body (pairwise) interactions. 

2. Preliminaries 

Consider a finite spin system with pairwise interactions, and modeled as a graph, G = (V,E). Let the 
partition function of this spin system be denoted by 

Z G = ^2 II ^i,j{x l ,x )W_(t)i{xi). 

f£X" {i,j)£G iev 

Let A C [n] be a subset of frozen vertices (i.e. vertices whose spin values are fixed) and let 
We wish to compute the following marginal probability with respect to the Gibbs measure, 

(2.i) p G ( X1 =a\x A )=^ x Y. n *"<,;'•'•<••'•./; n 

G 1]='- (iJ)eG iev 

xf£X A 

Instead of performing this marginal probability computation in the original graph G we shall create a corre- 
lation decay (CD) tree, 7a, on which the same marginal probability results by performing the computation 
as described in Section |2"T21 

2.1. The CD Tree. Similar tree constructions can be found for restricted classes of spin systems by |Wei06al 
IFS591 IGMP041 ISS051 IBG061 IJS06] . and in particular the one in [Wei06a| . Our starting point of the tree is 
the same as in [WeiOGa , i.e. wc begin by labeling the edges of the graph; draw the tree of self avoiding walks, 
T saw ; and include the vertices that close a cycle. In |Wei06a| . the vertices that close a cycle were denoted as 
occupied or unoccupied depending on whether the edge closing the cycle in T saw was larger than the edge 
beginning the cycle or not. 

Our main point of deviation from the construction in Wci06a is in the treatment of vertices that close 
the cycle that were appended in T saw . The vertices that close the cycle with higher numbered edges than 
those that begin the cycle (i.e. those that were marked occupied) are now constrained to take a particular 
spin value a q . The vertices that close the cycle with lower numbered edges (i.e. the unoccupied vertices) are 
constrained to take the same value as the occurrence of it earlier in the graph, i.e. the value of the vertex 
that begins the cycle. This constraint is denoted by a coupling line and influences the way the marginal 
probabilities are computed on the tree. The tree thus obtained is called the CD-tree, Tqd, associated with 
graph G. 

Definition 2.1. A coupling line on a rooted tree is a virtual line connecting a vertex u to some vertex v in 
the subtree below u. This line will play a role in the computation of the marginal probabilities as will be 
explained in detail later. In brief words, when one descends into the subtree of u to compute the marginal 
probability that u assumes a spin cr^, then the vertex v becomes frozen to o~i, the same as u. Thus, the spin 
to which v is frozen is coupled to the spin of u, whose marginal probability is being determined. 

Remark 2.2. One can easily make the following observations regarding coupling lines. A vertex can be the 
top end point of several coupling lines and indeed the number of coupling lines from any point is related to 
the number of cycles the vertex is part of in a certain subgraph of the original graph. A vertex can only be 
the bottom end point of a unique coupling line and for every such point, there is a unique twin point whose 
spin is frozen to o~ q , corresponding to traversing the cycle in the opposite direction. 
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2.2. Computation of marginal probabilities on the CD tree. Here we describe the algorithm for 
computing the marginal probability at the root for a tree with coupling lines. Let T be a rooted tree with 
frozen vertices A. In the tree presented in the previous section, the set A is also assumed to contain the 
vertices frozen to a q . Consider the recursion 

l00 , R a A( v _ M<*v) IT S=l ®v,u,(<Jv,CT l )R'^'(Ui = <Tj) 

At this step (proven by the next theorem) we will be computing the ratio of the probability that the root 
assumes a spin a v (with respect to the reference spin a q ), and therefore the lower end points of the coupling 
lines joined to the root to be frozen to a v . Thus the set of frozen vertices A gets appended with this subset of 
vertices; and the subset of this enhanced A that is in the subtree of the ith child is denoted as cta; . (There is 
an abuse of notation in that a ^ depends on the spin a v as A gets appended with the new vertices frozen by 
the dotted lines to a v .) One can use the above recursion to recursively compute the ratios for the correlation 
decay tree. The validity of this computation forms the basis of the next theorem. 

Remark 2.3. Consider a rooted tree with D denoting the maximum number of children for any vertex. Let 
C denote the computation time required for one step of the recursion in (|2.2[) . then it is clear that computing 
the probability at the root given the marginal probabilities at depth £ requires Q([{q — VjD] 1 ) time. The 
hidden constants in O depend on C and q. Observe that a bound for the computation time, tg, at depth £ 
can be obtained via the recursion t( < qC + [(q — l)D]tg-i. 

Note that whenever the tree visits a frozen vertex, the subtree under the frozen vertex can be pruned as 
this does not affect the computation. Similarly the subtree under a vertex that is also below the lower end 
of the virtual coupling line can be pruned. This leads to a subtree, T£ D , of Tcd- 

Example 2.4. We shall demonstrate this construction and computation using the following example graph 
with edges labeled in the usual lexicographic order. We shall retain the labeling of vertices on Tcd to reflect 
its origin from G but other than that they play no role in spin assignments and two similarly labeled vertices 
can have arbitrary spin assingmcnts in general. 




Let us assume that we are interested in computing the marginal probability of the vertex a for valid 
5-colorings of the graph G using the tree Tcd on the right. A coloring is valid if no two adjacent vertices 
are assigned the same color. For this interacting system a G {1, 2, 3, 4, 5} and 



$(a 4 ,(j,) 



1 if <jj ^ (ij 

if <7i = (Tj 



and the potential function (f>(ai) = 1. Let us assume that the vertices b, c are frozen to spins 2, 3 respectively 
and the reference spin a q — 4. It is easy to see using symmetry or explicit computation that a takes spins 
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Figure 2. The CD-tree Tq D and the subtree Td, pruned by the frozen vertices and coupling 
lines. The frozen colors are written adjacent to vertices in A. 



1,4,5 with probability 1/3 each, or in other words the ratios (with respect to color 4), -Rg(I) = Rg(^) — 
i?c(5) = 1. The pruned subtree Tq D can be drawn as in Figure^ 
Equation (|2.2j) gives 

(2 ' 3) ^V-r^D + RtAV + RtAsY 

where Td represents the subtree of T^ D under vertex d. The frozen subtrees Td for the four computations 
i?T d (l), RtAQi ^r d (4), i?T d (5) are represented in Figured 




Figure 3. The subtree Td(-) for the computations i?r d (l), i?T d (4), i?r d (4), i?r d (5), respec- 
tively. Note the spin of the new frozen vertex as forced by the coupling line in the four 
cases. 



The resultant subtrees Td(-) have the usual computation procedure (i.e. they do not have coupling lines); 
for example, the value i?T d (l) can be computed as 



RtAV 



( R Tc (2) + R Tc (3) + i? Tc (4) + R Tc (5) \ / R T) (2) + R Tf (3) + R Tf (4) + R T} (5) 



V R T , (1) + Rt s (2) + R Te (3) + R Te (5) J \R Tf {l) + Rt s (2) + R Tf (3) + R Tf (5) 



3,3,3, 3\ /3,3 +1+ 3 
4 ^ 4 ^ 4 T 4 W 4^4^ 1 ^4 



1+2+3,3 / \3,3,3,3 
1 ^4 T 4 T 4/ \4^4 T 4^4 



= 1. 



By symmetry to the previous computation Rx d (2) = RT d (5) = 1 and from the definition, i?T d (4) = 1. Thus 
from (|2.3[) one obtains 

R TCD (D= 1 + 1 + 1 =1. 

as desired. 

Remark 2.5. The next theorem and its proof is essentially the same as in |Wei06b| ; therefore we will use the 
same notation whenever possible and skip the details of similar arguments. 



Theorem 2.6. For every graph G = (V,E), every A C V, any configuration o~\, and all o~ v 

R G A (v = a v ) =R^ D (v = a v ), 

where Wj!^ n (v = o~ v ) stands for the ratio (with respect to the reference spin, say q) of the probability that the 
root v of Tqd has spin a v when the computation is performed as described above. The actual probabilities 
can be computed from the ratios by normalizing them such that the probabilities sum to one. 

Proof. Let a q be a fixed spin. Define the ratios 



rih°v) 



v 



PG A ( V = a <l) 



Let d be the degree of vertex v and let u%, 1 < i < d be its neighbors. If the graph G was indeed a tree T, 
then we can see that the following exact recursion 

( 0A s R a A , n _ <M<V) TT ELl S^KiO-Q-K^^i = m) 

M°q) fJi ELi ®v,u t (<rq, o-i)R Ti * ( Ui = at) 

would hold, where Ti is the subtree associated with the neighbor Ui obtained by removing the iih edge of v, 
and <TAi is the restriction of o~\ to A n Tj and appended with the new vertices frozen to o~ v corresponding to 
the lower endpoints of coupling lines originating from v. 

Fixing the vertex of interest v, define G' as the graph obtained by making d copies of the vertex v and 
each Vi having a single edge to In addition, the vertex potential <j) v {a v ) is re-defined to (f>l^ d (a v ). It is 
easy to see that the following two ratios are equal 

Pq A {v = O-y) _ Pg){vi = (Ty, —,Vd = °~v) 



Defining 



PG A ( v = cr i) P G A (vi=o- q ,...,v d = a q ) 

PG A ( V 1 = a ^ -■> V i = a v,Vi+l = Cr q , ..,V d = O-g) 



one sees that 



PcP(vi = a v , ...,Vi-\ = a v ,Vi = a q , ..,v d = a q ) 

d 

^ A K) = Il i? G> ! K)- 

i=l 

It is easy to see that R G AT y. (<r„) is the ratio of the probaility that the vertex Vi = o~ v to the probability of 
Vi = o~q, conditioned on o~\ and Ti, where Tj denotes the configuration where vertices Vi, ...,Vi—i are frozen 
to a v and vertices Uj+i, 1)4 are frozen to a q . 

In G', the vertex Vi is only connected to Ui\ and let G' \ Vi denote the connected component of G' that 
contains Ui after the removal of the edge (i>i, ui). Therefore 

G '' vMv) 4>l /d (a q ) ELi ®v,uM q ,ai)R&\ Vi {ui = *i) ' 

and hence 

Observe that the recursion p.5p terminates since at each step the number of unfixed vertices reduces by one. 



Remark 2.7. Observe that the equation in (J275J) is similar to the one for the tree (|2.4j) . This similarity will 
help us identify the recursion (|2.5p to be exactly the same one in Tqd with the condition corresponding 
to <7a along with the coupling of the values of vertices that was used in its definition. The key difference 
between the binary spin model in [Wei06aj and this proof also lies here; that in the binary spin model one 
of the spins was always the reference spin and the other was the subject of the recursion. Thus the coupling 
of the spin to its parent in Tqd was implicit. 
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From the similarity of (|2.5[) and (|2.4p , one can use induction to complete the proof provided that the 
graph G \ Vi with the condition corresponding to a\Ti leads to the same subtree of Tqd corresponding to 
the i-th child of the original root with the condition corresponding to o"A; ■ It is easy to observe that the two 
trees are the same - both are paths in G starting at Ui, and copies of v are set to a v if it is reached via a 
smaller numbered edge and set to a q else. The above observation along with the fact that the stopping rules 
coincide for the two recursions completes the proof of Theorem 12.61 using induction. 

□ 

2.3. Multi-spin interactions. In this section, we extend the results of the previous section from pairwise 
interactions to multi-spin interactions. The underlying model can be depicted by a hypergraph with the 
hyperedges denoting the vertices involved in an interaction. 

Consider a finite spin system whose interactions can be modeled as a hypergraph, G = (V,E). Let the 
partition function of this spin system be denoted by 

Z G = J2 ~[[$e(Xe)~[[<t>i(Xi)- 

SeX n e£E iev 

As before, let A C [n] be a subset of frozen vertices (i.e. vertices whose spin values are fixed) and let 

Z G= E n*«(*e)II 

xf£X A eeE iev 

We wish to compute the following marginal probability with respect to the Gibbs measure, 
(2.6) P G ( Xl = a\X A ) = LI II 

G as 1 =<7, e eE ieV 
x<£X A 

2.4. CD hypertrees on hypergraphs. The motivation for the following hypertree essentially comes from 
the proof of the CD tree in the previous section. Let the n vertices in V be numbered in some fixed order, 
1, n. The tree is constructed in a top down approach just as the tree of self avoiding walks. 

The procedure described below is similar to a generalization of the tree of self avoiding walks for graphs. 
For ease of exposition we will keep describe the construction using the following example. Let V = 
{1,2,3,4,5} and let the hyperedges be {(1, 2, 3), (1, 2, 5), (1, 3, 4), (2, 5, 4)}. Let us assume that vertex 1 
is the root. From G construct the graph G\ with vertex 1 replicated thrice (equal to its degree) to l a , lj,, l c . 
Let the resulting hyperedges be {(l a , 2, 3), (lb, 2, 5), (l c , 3, 4), (2, 5, 4)}. Observe that, 

P G (X1 = O"! ) _ ggifola = gllgjj: = a li X lc = Qi) 

Pg{xi = <T ) ^bi0ci o = <ro,Xi h = <tq,xi c = a ) 

_ PgAxi* = Vl\xi b = VQiX1 c = op) P Gl (xi b = Q\ \xi a = Ol , Xi c = CTp) 

p Gi{x\ a = o~o\xi b = a ,xi c = cr ) PgAxu = cr \xi a = ai,xi c = a a ) 

PgAxi c = &l\xi a = (Ti,Xi b = ai) 
P G l(xi c = <To\X! a = <7l,Xl b = CTl) ' 

Now consider a graph H where vertex 1 has degree three and such that the removal of vertex 1 and 
the three hyperedges, disconnects the graph into 3 disconnected components. The first component, H\, 
contains the set of vertices {2' 1 ', 3'- 1 -', 4^', 5^ , lj, 1 '' , li 1 ''}, with the vertices lP^ and lc frozen to have spin 
(To- The hyperedges that form part of this component (along with the root) are {(1, 3^), (li , 2^ l \ 5^^), 
(U 1) ,3( 1 ),4( 1 )),(2( 1 ),5( 1 ),4( 1 ))}. 

The second component, H 2 , contains the set of vertices {2^ 2 \ 3^ 2 \ 5^ 2 \ lc 2 "*}, with the ver- 

(2) . (2) 

tex la frozen to have spin o~\ and the vertex lc frozen to have spin (To; and the hyperedges being 
{(l a 2) ,2( 2 \3^), (1,2( 2 \5( 2 >), (li 2) ,3^,4( 2 )),(2( 2 ),5( 2 ),4( 2 ))}. Finally, the third component, H 3 , contains 
the set of vertices {2^\ 3 (3 \ 4^ 3 \ 5^, li 3) , l[ 3) }; the vertices li 3) and if } frozen to have spin o~\ ; and hyper- 
edges {(li 3) , 2( 3 ), 3< 3 >), (l^, 2( 3 ), 5( 3 )), (1, 3( 3 ), 4( 3 )), (2( 3 ), 5( 3 ), 4( 3 ))}. 
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It is clear that the following holds, 

P H {xi = o-i ) _ PhJjci = Q-i ) Ph 2 (xi = o-i ) Ph 3 (xi = en) 
Ph{x\ = (j ) PhA x i = o-q) Ph 2 (xi = ct ) Ph s (xi = a ) 

_ PgA^U = &l\xi b = <TO,Xl c = Co) PgA^U = <7l\Xl a = <7l,Xi e = (To) 

X 



P Gi(xi a = Co\xi b = <TQ,Xl c = Go) PgA X U = °~o\ X la = °~li x lc = °"o) 
PGAXl c = (Tl\xi a = (Tl,X lb = 0-1 ) 
PgA x 1c = °~o\ x l a =(Tl,Xl b =0-i)' 

= P G {xi=°i) 
Pg{xi = a ) ' 

Further, this general procedure for separating the children of the root can now be performed iteratively on 
each of its children to yield a CD hypertree, Hcd, in the same way as one generates the CD tree for pairwisc 
interactions. Since at each stage, the number of unfrozen vertices reduces by one, the procedure terminates 
yielding a hypertree with the degree of every vertex bounded by its degree in the original hypergraph. This 
leads to the following result for the case of hypergraphs, 

Theorem 2.8. For every hypergraph G = (V,E), every A C V, any configuration a\, and all o~ v 

R G A (v = a v )=R^ OD (v = a v ), 

where W^ cd (v — o~ v ) stands for the ratio (with respect to the reference spin, say o~o) of the probability that the 
root V of Hcd has spin o~ v when computations are performed as described previouly. The actual probabilities 
can be computed from the ratios by normalizing them such that the probabilities sum to one. 

3. Spatial mixing and Infinite regular trees 

In this section, we study spatial mixing and demonstrate sufficient conditions for spatial mixing to exist 
for all graphs G with maximum degree 6+1 in terms of spatial mixing conditions on the infinite regular 
tree, T b , of degree 6+1. We review the concept of strong spatial mixing that was considered in |Wei06aj 
and prove one of our main results. 

Definition 3.1. Let <5 : N — > R + be a function that decays to zero as n tends to infinity. The distribution 
over the spin system depicted by G = {V, E) exhibits strong spatial mixing with rate 5{-) if and only if for 
every spin a±, every vertex v £ V and A C V and any two spin configurations, a\,T\, on the frozen spins, 
we have 

\p(v = <7i|JC A = cr A ) -p(v = a\\X A = r A )| < <J(dist(u, A)), 
where A C A stands for the subset in which the frozen spins differ. 

Let T denote a rooted tree. We say that a collection L of virtual edges is a set of valid coupling lines, 
if they satisfy the following constraints: a coupling line joins a vertex to some vertex in the subtree under 
it; the lower endpoints of the coupling lines are unique; no pair of coupling lines form a nested pair or an 
interleaved pair, i.e. the endpoints do not lie on a single path. 

Remark 3.2. Observe that the pruned CD tree, Tq D , is a tree with a set of valid coupling lines. The pruned 
CD tree also has the property that the end points of coupling lines have a corresponding twin leaf that is 
frozen to a q , but we have not imposed that requirement above. It is possible that enforcing that requirement 
and thus limiting the set of valid coupling lines may strengthen the results, but we omit it here for ease of 
exposition. 

Definition 3.3. Let T denote a rooted tree. Let S : N — > IR + be a function that decays to zero as n tends to 
infinity. The distribution over the spin system at the root, v, of T exhibits very strong spatial mixing with 
rate 5{-) if and only if for every spin o~\, every set of valid coupling lines, for every A C V and any two spin 
configurations, o~\, t\, on the frozen spins, we have 

p T (v = <ji\Xa = cta) ~Pt(v = o~i\Xa = ta) < <5(dist(w,A)), 

where A C A stands for the subset in which the frozen spins differ. The computations of the marginal 
probability on this tree with coupling lines is performed as described in Section [2.21 
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Remark 3.4. From the recursions observe that the computation tree can be pruned at any frozen vertex or 
at any lower endpoint of a coupling line. 

Remark 3.5. It is clear that very strong spatial mixing reduces to strong spatial mixing in the absence of 
coupling lines. Thus very strong spatial mixing on a tree implies strong spatial mixing with the same rate 
on the tree. 

The main result of this section is that very strong spatial mixing on the infinite regular tree with degree 
6+1 implies strong spatial mixing on any graph with degree 6+1. Wc will distinguish between two cases 
of neighboring interactions: 

(i) Spatially invariant interactions $(■, ■) > and potentials </>(•) > where the interaction matrix 4>(-, •) 

satisfies the positively alignable condition stated below. 
(ii) General spatially invariant interactions $(■, ■) > and potentials </)(•) > that need not satisfy the 
positively alignable condition. 

Definition 3.6. A matrix $(■, •) is said to be positively alignable if there exists a non-negative vector a(-) 
such that the column vectors of the matrix $ can be aligned in the [l...l] T direction, i.e. = [ll...l] T . 
Alternately, the vector [l...l] T belongs to the convex cone of the column vectors of 

Note that a sufficient condition for <f> to be positively alignable is the existence of a (permissive spin) <jq 
which satisfies the following property: $(0+ Co) = c\ > for all spins cr^, and </>(co) = C2 > (e.g. the 
"unoccupied" spin in independent sets). 

Remark 3.7. We will state the next theorem for Case (i), and a similar theorem (see Section |3~2"|) will hold 
for the other case. The reason for separating the two cases is that in Case (i) one can stay within the same 
spin space in the infinite tree T b , to verify very strong spatial mixing. 

3.1. Interactions that are positively alignable. 

Theorem 3.8. For every positive integer b and fixed $(•, ■),(/){■) such that $ is positively alignable, if T b 
exhibits very strong spatial mixing with rate 5 then every graph with maximum degree 6+1 and having the 
same $(•,•), <f>(-) exhibits strong spatial mixing with rate 5. 

Proof. The proof of this theorem follows in a straightforward manner from Theorem l2.6l If T A is the tree in 
Section [27X1 rooted at v, i.e. Tqd adapted to A, then Theorem 12.61 implies that 

(3.1) p G (v = <Ti|A A = cr A ) ~Pg(v = <ti\X a = t a ) = Pt a (v = (Ti\X A = a A ) - pt a { v = ^i\ x A = t a) • 

Further note that for any subset A of vertices of G, dist(v,A) is equal to the distance between the root v 
and the subset of vertices of T A composed of the copies of vertices in A as the paths in T A correspond to 
paths in G. To complete the proof we need to move from T A to T b . 

Note that $ is positively alignable is equivalent to the existence of a probability vector a(-) such that 

(3.2) 5^*(oj,(Ti)0(o-i)o(o-i) =ci > 0, Va ; . 

i 

As every vertex in Ta has at most the degree of the vertex in G, one can view T A as a subgraph of T b . 
(As before A is also assumed to contain the vertices that are frozen to a q by the construction.) Let d(T\) 
represent the non-fixed boundary vertices, i.e. vertices in Ta that are not fixed by A, are not the lower end 
points of a dotted line, and have degree strictly less than 6+1. Let Ai denote the set of vertices: in T b \ T\ 
that is attached to one of the vertices in 9(Ta). Append Ai to Ta to yield a subtree, T A of T b . Choose the 
spins for the vertices in Ai independently, distributed proportional to (f>(-)a(-). 
We claim that 

Pt a (v = o-i|A A = <7 A ) =pfb A {v = ai\X A = cr A ). 
This follows from the observation that for all Ui in Ai we have 



Z)Ll^,fi( Cr 9' Cr ')- R T i i ( U i = a l) 



where (a) follows from (|3.2[) . Thus the recursions in T A becomes identical to the ones in T\. 
Now from the very strong spatial mixing property that T 6 is assumed to possess, we have 



Pt a (v = ai\X A = c a ) - Pt a (v = a-i \X A = ta) 

= p fb ^ (v = <ri\X A = <t a ) ~Pfb(v = <ti\X a = r A ) < 5(dist(u, A)). 

The above equation along with (|3.ip completes the proof. □ 

Corollary 3.9. Very strong spatial mixing on T b ( with positively alignable $ ) implies a unique Gibbs measure 
on all graphs with maximum degree b + 1 . 



Proof. From Theorem l3.8[ very strong spatial mixing on T b with positively alignable $ implies strong spatial 
mixing on graphs with maximum degree 6+1. Since strong spatial mixing is a sufficient condition for the 
existence of a unique Gibbs measure on all graphs with maximum degree 6+1, the result follows. □ 

3.2. General Interactions. Consider the scenario of general interactions. Define an extra (permissive) 
spin Co that satisfies the following property: $(<to,<t;) = C2 > 0,(j)(o~o) = C3 > 0. If there is very strong 
spatial mixing on the infinite tree with this extra spin o~q then the following analogue of Theorem 13.81 holds. 

Theorem 3.10. For every positive integer 6, if T b (with the extra spin <7o ) exhibits very strong spatial mixing 
with rate 8 then every graph with maximum degree 6+1 and having the same $(•, •), <f>(-) exhibits very spatial 
mixing with rate 6. 

Proof. The proof is similar to that of Theorem 13.81 except for the following changes. Fix the spins of the 
vertices in Ai to <jq instead of generating them independently with probability a(-). Condition also on the 
event that none of the sites in Ta are assigned the extra spin <xo. With these two changes made, the proof 
of Theorem 13.81 carries over and hence is not repeated. □ 



3.3. On very strong spatial mixing on trees. The idea of very strong spatial mixing is different from 
the standard notions of spatial mixing due to the introduction of coupling lines. However, it is key to note 
that these coupling lines behave similarly in configurations a A and ta and thus conceptually it is similar to 
strong spatial mixing where vertices close to the root are allowed to be frozen to identical spins in both o~ A 
and ta- However the fact that the actual computations involve spins to be frozen to different values may 
lead to a strictly stronger condition than strong spatial mixing. In some sense, this condition demands that 
the difference of marginal probabilities depend only on the spatial locations of the frozen vertices and not 
on the spins that these vertices assume, reminiscent of uniform convergence in analysis. 

One sufficient condition for very strong spatial mixing is the existence of a Lipschitz contraction for 
probabilities or log-likelihoods, as in [BN061 IBGK+06] , In general if one can show that some continuous 
monotone function /(j» CTA (<?„)), where Pa^v) is computed using the recursions in (|2.2p from the probabilities 
of its children {Pi A {<Ji)}, satisfies 

|/(p CTA K)) - f(p T H*v))\ < JfmaK|/(pf A fa)) - /(pTVO)I 

for some K < 1, then one can show that this implies very strong spatial mixing (indeed with an exponential 
rate). 

4. Algorithmic implications 

The idea of strong spatial mixing, combined with an exponential decay of correlation, has been used 
recently in |Wei06a[ IGK071 |BGK + Q6] to derive polynomial time approximation algorithms for counting 
problems like independent sets, list colorings and matchings. Traditionally these counting problems were 
approximated using Markov chain Monte Carlo (MCMC) methods yielding randomized approximation algo- 
rithms. In contrast the new techniques based on spatial correlation decay yield deterministic approximation 
algorithms, thus providing a new alternative to MCMC techniques. 



Definition 4.1. A pairwise interacting system ($(•,•),</>(■)) is said to have an exponential strong spatial 
correlation decay if an infinite regular tree of degree D, rooted at v, has a very strong spatial mixing rate, 
(5(dist(w, A)) < e -«odist(t,,A) for SQmc Kd > q 



From the previous two sections, we will see that the marginal probabilities (and thus the partition function) 
for any pairwise interacting system with finite spins with an exponential strong spatial correlation decay, 
whose interactions can be modeled as a graph G with bounded degree, can be approximated efficiently. 

Lemma 4.2. Consider a graph G of bounded degree, say D, denoting the interactions of a pairwise inter- 
action system with exponential strong spatial correlation decay. Then the marginal probability of any vertex 

v can be approximated to within a factor (1 ± e), for e = , in a polynomial time given by Q(n K o og£) ). 

Proof. From the definition of strong spatial mixing rate it is clear that the marginal probability at the root 
can be approximated to a (1 + e) factor, provided dist(t>, A) > — =: £. That is, for any initial assignment 
of marginal probabilities to lead nodes at depth / from the root, the recursions in (|2.4p would give a (1 + e) 
approximation to the true marginal probability. 

Let C denote the computation time required for one step of the recursion in (|2.4p . then it is clear that 
computing the probability at the root given the marginal probabilities at depth £ requires 0([(g — l)D] e ) 
time. The hidden constants in O depend on C and q. Observe that a bound for the computation time, te, 
at depth £ can be obtained via the recursion tt < qC + (q — l)Dt(.-\. 

Therefore, if one wishes to obtain an e = approximation, then the computational complexity would 

be <d(n K D log ' 9 X ^ D ). Thus, the marginal probability as well as the partition function can be approximated 
in polynomial time. □ 

Remark 4.3. It is well known that the partition function can be computed as a telescopic product of marginal 
probabilities (of smaller and smaller systems) and thus an efficient procedure for yielding the marginal 
probabilities also yields an efficient procedure (usually time gets multiplied by n and the error gets magnified 
by n) for computing the partition function. 

5. Remarks and conclusion 

On colorings in graphs: Consider the anti- ferromagnetic hard-core Potts model with q spins, or equivalently, 
consider the vertex coloring of graph G with q colors. It is conjectured that for any infinite graph with 
maximum degree D (and with appropriate vertex transitivity assumptions, so that the notion of Gibbs 
measures make sense), one can show that this system has a unique Gibbs measure as long as q is at least 
D + l. Using the results in the previous sections, if one establishes that the infinite regular tree with degree 
D has very strong spatial mixing when q is at least D + l, then this will imply that any graph with maximum 
degree D will also have very strong spatial mixing and thus a unique Gibbs measure. 

It is known from |Jon02| that the infinite regular tree with degree D has weak spatial mixing when the 
number of colors is at least D + l. The nature of the correlation decay suggests that very strong spatial 
mixing should also hold in this instance. However, it is not clear to the authors that the proof can be 
modified to provide an argument for very strong spatial mixing (or even whether the proof can be extended 
to show weak spatial mixing for irregular trees with maximum degree D). Another possible approach that 
is yet to be explored completely is whether weak spatial mixing and some monotonicity arguments, like in 
|Wei06aj for independent sets, will directly imply very strong spatial mixing. 

Conclusion: We have shown the existence of a computation tree in graphical models that compute the exact 
marginal probabilities in any graph. Further we have shown that from the point of view of very strong spatial 
mixing, a notion of spatial correlation decay, the infinite regular tree is a worst-case graph. So proving results 
on infinite regular trees would immediately imply similar results for graphs with bounded degree. 
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